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The  following  corrections  apply  to  AFCRL  Research 
Report  62-549,  entitled  "A  Phrase-Structure  Language  Trans¬ 
lator,"  and  dated  August  1962. 


Page  3 

Line  18,  change  'tract'  to  'track' 

Line  27,  change  to  read  'character^?  (sometimes  written  as  P 

herein)  is  used  for  thi^urpose. ' 

Line  28,  change  'PnJ'  to  [^nj' 

Line  31,  change  'P  '  to  ' 

“  n  n 


Page  4 

Lines  1  through  4,  change  'P'  to  'Sp' 
Line  15,  change  'P'  to'^' 


Pa^e  7 

Line  4,  change  '  P,  P  to  'CP ,  3'  ' 

Lines  21,  22,  28,  29,  and  30,  change  'P'  to  'CP' 

Page  8 

Lines  1,  2,  4,  5,  8,  and  14,  change  'Pll',  'P2l',  and  'Pn'  to 
'1^21,  andi^n' 

Page  11 

Lines  6,  7,  and  8,  delete  all  between  'and'  of  Line  6  and  'go'  of 
Line  8  (sentence  will  then  read  'If  STABS (i)f4  GOAL  we  set 
SF  to  F  and  go  to  NEWS4.') 

Page  12 

Line  15,  change  'F^'  to 

Line  19,  change  '  and  Fg'  to  and  5^2* 

Line  26,  change  'a  P  or  F'  to  'aCP  ovP' ' 

Line  33,  change  to  'if  z-CP  we  go  to  WASSP' 

Line  34,  change  to  'if  an$^  we  go  to  WASSF' 

Page  13  ^ 

Lines  31,  34,  and  35,  change  'P'  and  'P  \otP'  and  ' 

Page  22 

Line  12,  change  'EROES'  to  'ERORS' 

Page  34 

Line  23,  change  to  read  'where  n  is  either  1,  2,  or  3  ' 
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Abstract 


The  author  discusses  a  syntax-directed  translator  with  several  new  features 
which  result  in  much  faster  running  time  and  the  ability  to  handle  longer  input 
strings.  The  computer  program  for  the  translator  is  described  in  detail,  and  an 
example  of  its  operation  is  provided. 
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A  Phrase-Structure 
Language  Translator 


1  INTRODUCTION 

Approximately  five  months  ago  many  articles  began  appearing  in  computer 
journals  regarding  work  being  done  on  compilers  which  use  a  syntactical  descrip¬ 
tion  to  direct  the  translation  of  an  input  language  (phrase-structure)  to  the  output 
language.  Although  various  people  use  somewhat  different  methods,  the  translation 
processes  are  all  relatively  short  compared  to  previous  compiler-translators,  are 
elegant  in  form,  and  can  easily  be  programmed  to  accept  changes  in  the  input 
language.  On  the  other  hand,  they  tend  to  be  much  slower  than  their  specialized 
ancestors  and  consume  large  amounts  of  storage  during  operation. 

In  order  to  study  these  new  techniques  one  of  them  was  programmed.  While 
they  were  all  quite  similar,  the  methods  of  E.  T.  Irons  were  chosen  as  a  starting 
point  for  two  reasons: 

a.  His  notation  for  specifying  the  translation  process  is  compact  and 
easily  handled. 

b.  The  linked  list  which  is  built  up  during  the  parsing  of  the  input  string 
and  which  contains  all  information  for  the  final  output  string  seems  to 
be  easier  to  work  witn  than  other  systems. 

The  two  basic  programs,  DIAGRAM  and  TRANSYM,  took  only  three  weeks  from 
start  of  writing  to  production  runs.  These  will  be  described  in  detail  in  this  paper. 


(Received  for  publication,  14  June  1962) 
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The  notation  of  Mr.  Irons  was  used  wherever  possible  to  clarify  references  to 
his  works.  The  program  TRANSYM  is  very  much  like  the  program  of  Mr.  Irons, 
and  is  fairly  straightforward.  DIAGRAM  has  been  changed  considerably,  however, 
so  it  now  bears  only  a  general  resemblance  to  its  progenitor.  It  was  DIAGRAM 
(and  programs  of  similar  nature  in  other  systems)  which  was  eating  up  so  much 
time  and  storage.  The  program  DIAGRAM  described  herein  operates  from  100  to 
500  times  faster  than  any  others  we  know  of.  It  uses  about  40%  less  storage  for 
its  output  string  and  about  90%  less  storage  for  its  syntactical  tables.  These 
programs  have  been  enlarged  into  a  system  which  is  being  used  regularly  now, 
and  it  is  this  system  that  is  described  in  this  report. 


2  GENERAL  STATEMENT  OF  PURPOSE 

The  system  was  designed  in  the  usual  monitor  fashion;  i.e. ,  several  different 
translations  can  be  done  in  one  run  on  the  computer.  Each  translation  will  ordi¬ 
narily  consist  of  the  following  parts: 

(1)  Specifying  the  allowable  syntactical  units. 

(2)  Syntactical  description  of  the  language;  i.e.,  how  the  'small'  syntactical 
units  fit  together  to  form  'big'  units. 

(3)  Specifying  whence  the  input  is  to  come  and  how  it  is  to  be  converted  into 
the  input  string. 

(4)  Performing  the  translation  process  with  parameters  stating: 

a.  the  'goal';  i.e.,  the  syntactical  unit  the  input  string  is  expected  to 
form. 

b.  what  output  converter  is  to  be  used  and  where  the  output  is  to  go. 

The  translation  process  produces  an  output  string  which  is  put  out 
in  its  final  form  (printed,  punched,  written  on  tape,  etc.)  by  the 
output  converter. 

A  syntactical  unit  can  be  either  the  name  of  a  single  unit  in  the  input  string  or 
a  name  assigned  to  a  group  of  input  units.  Thus,  we  might  call  an  'A'  simply 
'A'  (although  we  could  as  easily  call  it  a  'B');  bigger  units  could  be,  say,  'word,' 
'number,  '  'sentence,  '  'sum,  '  ele.  The  purpose  of  Part  (1)  is  to  assign  a  number 
to  each  of  these  units.  Currently  these  ai’c  numbered  000-777  (octal)  giving  a 
maximum  of  512  distinct  units.  Only  125  of  these  have  been  used  to  date,  so  this 
appears  to  be  a  reasonable  selection.  When  the  syntactical  descriptions  are  read 
in,  they  are  ti-anslated  into  strings  of  these  numbers.  Digits,  letters,  and  punctu¬ 
ation  characters  should  be  assigned  numbers  corresponding  to  their  input  repre¬ 
sentation  in  order  to  facilitate  input  conversion  (in  the  IBM  7090  this  means  an 
'A'  is  assigned  021,  etc.).  Part  (3)  is  dependent  upon  the  individual  application, 
although  it  usually  takes  the  form  of  a  character-to-eharaeter  conversion  with  a 
pi'ovision  for  dropping  extra  blanks.  PaiMs  (2)  and  (4)  occupy  most  of  the  progi'am 
space,  and  these  will  be  dealt  with  in  separate  sections. 
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3  SYNTACTICAL  DESCRIPTION  OF  THE  LANGUAGE 

In  this  section  the  basic  form  of  the  syntax  descriptions  will  be  given.  The 
form  presently  used  is  actually  somewhat  different  from  this  in  order  to  save 
space  in  the  output  of  DIAGRAM,  and  to  allow  special  interruptions  for  debugging 
and  other  uses.  The  basic  form  is  essentially  that  of  Irons,  and  his  notation  will 
be  used. 

The  syntactical  description  consists  of  a  sequence  of  'sentences, '  each  having 
the  form: 


S  S  ,S  ,....S,S  =::S  ==^  { P  P  P  P  .  .  .  .  P  p) 

m  m-1  m-2  2  1 


components 


subiect 


definition 


Each  S.  is  a  'component,  '  a  syntactical  unit;  the  unit  S  following  the  =  :  : 

is  called  the  'subject'  of  the  sentence.  The  meaning  associated  with  the  part  of 

the  sentence  before  the  —  "j>  is  that  if  a  section  of  the  input  string  is  of  the 

form  S  S  ,  .  .  .  S.  ,  then  that  section  can  be  called  an  'S'.  The  string 
m  m-1  1  ^ 

{  PPP...PP)is  the  'definition'  of  the  sentence;  it  specifies  the  output  for 
S  in  terms  of  output  from  the  components.  Usually  each  P  will  be  a  character 
of  the  output  string  language;  however,  certain  characters  have  special  signifi¬ 
cance.  These  are  {  }  F  '3^  j  ]]  - —  . 

The  units  {  and  )  will  occur  only  when  syntactical  descriptions  are  to  be 
put  out.  A  counter  in  the  output  string  generator  keeps  tract  of  the  number  of  {  's 
minus  the  number  of  j  's.  Let  us  call  this  number  b  .  When  a  {  is  encoun¬ 
tered,  a  {  is  output  and  b  +  1  replaces b.  When  a  }  is  encountered,  b  is 
replaced  by  b  -  1  if  the  latter  is  not  negative  and  a  )  is  output.  If  b-  1  is 
negative  (  b  =  0  before),  then  the  output  string  generator  TRANSYM  has  reached 
the  end  of  that  definition  string  and  nothing  is  output.  TRANSYM  usually  begins  its 
action  with  the  character  following  the  leftmost  {  in  a  definition  string  with  b  =  0. 

Most  definitions  for  a  subject  require  information  from  the  definitions  of  its 
components,  sometimes  with  substitutions  being  made  in  these  definitions.  The 
character  P  is  used  for  this  purpose.  Assuming  that  the  counter  b  is  0,  the 
characters  Pn  11,  where  n  denotes  an  octal  interger,  signal  TRANSYM  to  obtain 
and  output  the  definition  corresponding  to  (counting  left  n  from  the  =  :  If 

substitutions  are  to  be  made  in  the  definition  for  S  ,  then  the  form  is 

n 

p  ;  X ,  —  pp  . . .  p  '  X,  —  pp  . . .  p  ;  . . .  ;  x  —  pp  . . .  p  u 

n  ’  1  ’2  ’  ’  i 

signifying  that  if  an  x.  is  encountered  it  will  be  replaced  with  what  is  to  the  right 
of  the  ^  up  to  the  next  j  or  ]]  .  The  strings  P  P  .  .  .  P  are  in  each  case 
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completely  general,  and  may  contain  more  P's,  etc.  If  b  =  bi  0  and  the  char- 

/  / 

acter  P  is  preceded  by  b,  's,  then  the  's  are  ignored  and  the  operation  pro- 

^  II 

ceeds  as  stated  above.  If  P  is  preceded  by  either  no  ^s  or  by  s,  where 

i=  ,  then  the  ^'s,  the  P,  and  all  the  characters  up  to  and  including  the  I 
are  output  as  is.  This  feature,  in  combination  with  the  {  's  and  }  's,  enables 
syntactical  descriptions  to  be  output. 

Occasionally  it  is  useful  or  necessary  to  perform  some  special  task  in  the  out¬ 
put,  such  as  outputting  the  sum  of  several  numbers,  or  counting  the  occurrences  of 
a  certain  character  in  a  definition,  etc.  The  character  4’  is  used  to  denote  a 
special  function  of  this  type.  Assuming  that  b  =  0,  the  characters 


m  groups 


cause  TRANSYM  to  call  the  function  of  m  arguments  and  no  arguments, 
respectively.  These  special  functions  must  be  coded  separately  and  put  into  storage 
when  the  other  programs  are  read  in  at  the  beginning  of  the  run.  If  b  0,  the 
use  of  ^'s  is  exactly  the  same  as  with  the  character  P. 


4  TABLING  THE  SYNTACTICAL  DESCRIPTION 


Consider  the  following  descriptions: 


«1 

“2  “3 

"4 

0:5  =:: 

"6 

{. 

.  .} 

"2  “8 

"9 

«10  =  :  : 

“11 

{. 

.  .} 

“1 

“2  “8 

"9 

"10=  - 

“l3 

=— >  {. 

•  .} 

“1 

“15  ^ 

->{■ 

.  .} 

No  information  is  lost  and  constructional  differences  are  made  more  apparent  if  we 
change  this  to: 


"i“2?3"4"5"6 


“8  "9  “10  ?'ll 


13 


I 

"l5  "16 


(. 

f 

\  • 

{. 


.  .) 
.  .) 
.  .} 


{.  .  .} 
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The  items  to  notice  are  the  following: 

a.  The  characters  =  :  :  and  >  can  be  omitted  if  we  agree  that  the  charac¬ 
ter  {  can  never  occur  as  one  of  the  a's.  Thus  the  subject  is  the  a  before  the 
first  {  and  the  definition  follows  that  { 

b.  The  method  of  examination  of  the  input  string  is  made  much  more  evident. 
To  elaborate  on  part  (b)  let  us  table  the  above  information  as  follows: 


STC  (i) 

STABS  (i) 

t 

STABD(t) 

a  TRAN(a) 

2 

15 

^2 

1 

} 

«1  2 

3 

8 

Xj 

1  1 

4 

0 

^4 

1 

} 

5 

0 

^5 

^2 

1 

6 

0 

1 

i 

7 

h 

0 

{ 

^3 

1 

1 

) 

8 

0 

^8 

^4 

1 

9 

0 

X9 

} 

10 

0 

^10 

1 

11 

13 

^11 

12 

CSl 

{ 

13 

0 

^13 

14 

^3 

{ 

15 

0 

^15 

16 

0 

^16 

17 

^4 

( 

Let  us  assume  that  the  unit  has  been  constructed  from  the  input.  The 
purpose  of  the  TRAN  table  is  to  index  the  STABS  table,  which  contains  the  syntax 
units  of  the  descriptions.  Thus,  if  we  are  to  construct  something  which  begins  with 
an  ,  TRAN(a^)  =  2  gives  us  the  starting  point  in  STABS  and  STC  for  the  de¬ 
scriptions  which  begin  with  a ^  .  STABS(2)  gives  the  next  unit  to  be  constructed 
from  the  input.  Since  STABS(2)  =  “2  not  a  subject)  DIAGRAM  will 

attempt  to  build  an  from  the  input.  If  cannot  be  constructed,  if  a^,  , 

or  Og  cannot  be  constructed,  of  if  does  not  lead  to  the  desired  final  form. 
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then  DIAGRAM  will  attempt  to  construct  STABS  (STC(  2)  )  =  STABS(15)  =  ot . 

If  the  input  string  does  yield  the  construction  ,  then  DIAGRAM  will 

recognize  that  it  has  constructed  an  and  note  in  an  output  string  that  the  cor¬ 
responding  definition  begins  in  STABD(STC(6  +  1)  )  =  STABD(  If  is  not 

the  end  result  and  further  construction  is  needed,  DIAGRAM  will  begin  building  the 
unit  found  in  STABS  (TRAN  (a^)  ).  Thus  we  see  that  the  syntax  descriptions  are 
broken  up  into  groups;  every  sentence  in  a  given  group  begins  with  the  same 
syntactical  unit.  TRAN  of  this  unit  refers  to  the  location  in  STABS  where  the  rest 
of  the  first  sentence  of  the  group  begins;  STC  provides  a  linkage  between  sentences 
within  a  group.  STABD  may  be  regarded  as  an  extension  of  STABS,  and  in  fact  it 
could  be  included  in  STABS  (Irons  had  them  in  a  combined  table),  but  since  they 
need  not  be  in  storage  at  the  same  time  (DIAGRAM  is  all  through  with  STABS  be¬ 
fore  TRANSYM  begins  on  STABD)  it  was  considered  best  to  separate  them  in  case 
storage  was  needed  later. 

Consider  the  following  sentences: 


^3 

X9 


"^6 


"^6 


X 

X 

X 

X 


4 

3 

4 

9 


X 


1 


On  occasion  it  is  necessary  for  DIAGRAM  to  know  if  a  unit  x^  can  be  con¬ 
structed  beginning  with  the  unit  x^  .  This  information  is  contained  in  a  Boolean 
table  called  the  SUCCR  (successor)  table.  The  SUCCR  table  for  the  above  sen¬ 
tences  would  be 

xX^  X.  X,  X,  X,  X,  X/  X_  Xo  Xn 

l\  1  Z  3  4  D  0  I  o  i 


0 

1 

1 

0 

0 

0 

0 

0 

1 


0 

0 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 

0 


0 

1 

1 

0 

0 

0 

0 

0 

0 
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and  we  see  that  SUCCR(1,4)  =  1;  SUCCR(1,5)  =  0,  SUCCR(2,  3)  =  1;  SUCCR(5J)=0 
for  all  j,  etc. 

The  table  S  indexes  the  table  B  which  contains  the  substitutions  in  effect  in 
TRANSYM.  Before  outputting  an  ordinary  character  x  (not  a  P,  F,  etc.) 
TRANSYM  first  checks  to  see  if  x  is  entered  in  S;  if  not,  then  x  is  output;  if 
X  is  in  S,  then  S(u)  gives  the  location  in  B  of  the  string  to  be  output  instead  of 
X.  S  and  B  will  be  described  in  detail  later. 


5  OTP 

OTP  is  the  name  of  the  output  from  DIAGRAM.  It  is  also  the  input  to 
TRANSYM.  OTP  is  a  list  containing  all  the  information  of  the  structure  of  the  in¬ 
put  string  according  to  the  syntactical  descriptions  in  STABS.  Its  form  determines 
the  operation  of  both  DL4GRAM  and  TRANSYM.  For  this  reason  it  is  essential  that 
the  reader  understand  OTP  completely  before  continuing. 

Entries  in  OTP  can  be  of  two  types:  direct  and  indirect.  The  indirect  entries 
will  be  preceded  by  a  —  to  distinguish  between  the  two.  Each  direct  entry  is  the 
address  of  a  definition  in  STABD,  that  is,  it  is  a  /?.  .  Each  indirect  entry  is  the 
address  of  a  (direct)  entry  in  OTP.  Suppose  we  have  the  descriptions: 


a  =  : 

:  letter  = 

{  A  } 

b  =  : 

:  letter  = 

{  B  } 

c  =  : 

letter  = 

{ c  } 

letter  =  : 

Itrgrp  = 

{PI]]) 

Itrgrp  letter  =  : 

Itrgrp  = 

(  P  2]]  PI]]} 

If  we  give  DIAGRAM  the  input  string 

a  b  c 

and  ask  it  to  construct  a  Itrgrp  (letter  group),  then  OTP  would  be 


1. 

P 

Z 

- ^ 

B  I 

2. 

^^3 

- > 

C  } 

3. 

P  2]]  PI]]) 

4. 

-2 

5. 

- ^ 

P  2  ]]  PI]]} 

6. 

-1 

7. 

P  1  D) 

8. 

- > 

A  } 

8 


TRANSYM  would  be  told  to  start  with  OTP(3).  To  find  P2l|  we  look  in  OTP(3  +  Z), 
where  another  P  2  ]]  sends  us  to  OTP(5+2).  There  a  P  1 1  sends  us  to  OTP(7+l) 
which  points  to  A  }  .  The  A  is  output  and  the  }  is  found,  so  we  go  back  to 

OTP  (7).  We  are  done  with  the  PI]]  so  we  move  on  to  the  }  here  also;  this 
sends  us  back  to  OTP(5).  To  find  P  1  ]]  we  look  at  OTP(5+l),  which  is  indirect 
and  jumps  us  back  to  OTP(l).  Here  we  output  the  B,  find  the  }  ,  and  return  to 
OTP(5).  TRANSYM  finds  another  }  here  and  so  exits  back  to  OTP(3).  Having 
done  the  P  2  ]] ,  we  move  on  to  the  PI]]  which  is  found  in  OTP(3+l)=  -2.  This 
jumps  us  to  OTP(2)  where  we  output  C  and  return  to  OTP(3).  We  run  into  the  } 
and  exit  to  the  controlling  program. 

The  reader  should  thoroughly  understand  the  above  before  proceeding.  It  con¬ 
tains  the  essence  of  the  operation  of  TRANSYM  as  well  as  the  details  of  OTP. 

When  TRANSYM  is  told  to  output  the  definition  pointed  to  directly  by  OTP(k)  and 
somewhere  in  that  definiton  Pn  is  encountered,  TRANSYM  calls  itself  recursively 
to  output  the  definition  pointed  to  by  OTP(k+n),  first  jumping  to  the  direct  link 
pointed  to  by  OTP(k+n)  if  it  is  indirect. 


6.,  DIAGRAM 

Before  outlining  the  operation  of  DIAGRAM,  a  few  comments  are  in  order  since 
it  generally  operates  quite  differently  from  the  program  given  by  Irons  in  ACM 
(January  1961  Communications).  The  most  important  change  is  the  lookahead  fea¬ 
ture,  which  for  problems  of  useful  size  and  complexity  results  in  an  increase  in 
speed  by  a  factor  of  from  100  to  500;  the  greater  the  complexity  the  greater  the  in¬ 
crease  in  speed.  Two  other  changes  have  resulted  in  about  a  40  %  decrease  in  the 
size  of  OTP. 

DIAGRAM  is  a  function  of  seven  arguments:  i,  goal,  param,  error,  S2,  S^,  S^. 
STABS(i)  is  the  unit  in  STABS  being  examined.  The  goal  is  the  syntactical  unit 
which  is  being  constructed.  Param  refers  to  the  output  cell  of  the  level  which  called 
DIAGRAM  to  construct  the  goal.  Error  is  where  DIAGRAM  will  return  control  if  it 
cannot  construct  the  goal.  S2  tells  whether  or  not  lookahead  has  already  been  per¬ 
formed  or  not.  Sj  and  S^  control  what  is  done  with  the  output. 

The  first  thing  done  by  DIAGRAM  is  to  save  information  it  will  need  when  it 
exits.  Since  it  is  a  recursive  function  these  items  are  put  into  a  push  down  list. 

The  quantities  saved  are:  all  the  arguments;  j  ,  the  input  string  pointer;  k,  the 
OTP  pointer.  In  addition,  space  is  left  for  storage  in  PDL  (push  down  list)  of  SW 
and  S^,  and  the  i  for  the  last  call  is  saved  so  that  it  can  be  restored  upon  exit. 

This  eliminates  the  storage  of  i  every  time  it  is  changed,  since  this  is  not  needed 
unless  another  call  is  made.  The  quantities  i,  j,  and  k  dre  saved  in  I,  J,  and  K, 
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respectively.  Each  section  of  DIAGRAM  is  given  a  name  for  reference  purposes. 
The  action  of  each  section  is  explained  in  the  following  word  flow  chart: 

DIAGRAM  (i,  GOAL,  PARAM,  ERROR,  S2,  S3,  S^) 

DIAGR:  i  — ^  I,  j  — ^  J,  k  — ^  K,  save  i  of  last  call  in  PAST  I.  Save 
GOAL,  PARAM,  ERROR,  S2,  S3,  and  S3. 

START:  Test  STABS(i+l).  If  it  contains  (  then  STABS(i)  is  a  subject,  so 
jump  to  STRTl.  Otherwise  test  S^  to  see  if  branching  is  possible: 

F  -  no,  T  -  yes.  If  F  then  jump  to  STRT8.  If  T  then  continue  to 
the  lookahead  section  after  saving  i  in  I^.  The  lookahead  section  is 
from  STRT3  through  STRT9. 

STRT3:  Test  STC(i+l);  if  non-zero  jump  to  STRT9. 

Test  STABS  (i+2);  if  it  does  not  contain  (  then  replace  i  with  i+1 
and  go  back  to  STRT3. 

STRT4:  At  this  point  the  following  is  known 

a)  STABS(I^)  is  not  a  subject. 

b)  STABS{l2+l)  is  a  subject  (where  I2  is  the  final  value  of  i  from 
the  above  loop). 

c)  STC(i)  =  0  for  branching  is 

possible  once  we  have  passed  STABS(I^). 

Thus  we  can  test  STABS(l2+l),  the  subject,  to  see  if  it  is  the  goal  or  if  it  can  lead 
to  the  goal,  that  is,  we  see  if  STABS(l2+l)  =  GOAL  or  SUCCR  (STABS(l2+l),  GOAL) 
is  true.  If  either  is  the  case,  we  set  to  F  and  go  on  to  STRT9.  Otherwise, 
we  reset  i  to  I3  and  go  to  STRTl,  since  it  would  be  useless  to  pursue  this  branch 
further. 

STRT9:  Reset  i  to  I,. 

STRT8:  When  we  arrive  here  we  prepare  to  start  construction  of  STABS(i) 

from  the  input  since  we  know  that  either  no  branching  is  possible  and 
we  definitely  want  to  get  to  the  subject,  or  else  branching  is  possible 
(STC(i)f^O  for  some  i  between  I3  and  the  subject)  and  hence  it  is 
too  early  to  make  a  decision.  Certain  decisions  are  made  about  pos¬ 
sible  future  output  relating  to  STABS(i)  at  this  point.  S3  is  tested. 

If  it  is  F  then  we  have  previously  noted  that  no  output  for  this  unit  is 
needed,  so  S^,  an  output  indicator  for  this  level,  is  also  set  to  F. 

If  S3  is  T  then  T(i),  which  is  a  quantity  stored  in  STABS(i)  along 
with  the  syntactical  unit,  is  tested.  If  T(i)=l  S^  is  set  to  F.  If 
T(i)=2  we  have  hit  a  special  function  to  be  performed  immediately, 
followed  by  a  jump  to  CONTN.  This  is  a  special  feature  for  debugging 
aids  and  two-pass  systems.  In  this  case  STABS(i)  is  the  name  of  a 
function  rather  than  a  syntactical  unit.  S^  is  also  set  to  F  when 
T(i)=2.  If  T(i)=^l  or  2  is  set  to  T  and  we  go  on  to  SMALJ. 
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SMALJ:  j  is  replaced  with  j+1  in  preparation  for  looking  at  the  next  input. 

INPUT(j)  is  obtained  and  compared  with  STABS(i).  If  they  are  equal 
the  input  unit  itself  is  the  syntactical  unit  we  desired,  so  SW  is  set 
to  T.  If  not,  SW  is  set  to  F.  In  any  case,  we  need  to  know  if  we 
can  build  STABS(i)  from  the  string  of  characters  starting  with 
INPUT(j),  for  even  if  SW=T  we  may  be  able  to  build  a  bigger  STABS(i) 
than  just  INPUTlj)  alone.  Thus  SUCCR  (INPUT(j),  STABS(i)  )  is 
performed.  If  it  yields  F  (false)  we  skip  to  NOGO,  if  T  we  go  on 
to  STRT2. 

STRT2:  At  this  point  we  have  the  first  character  of  STABS(i),  so  we  call 

DIAGRAM  (TRAN(INPUT(j)  ),  STABS(i),  OTCEL,  NOGO,  T,  S^.  S^) 
that  is,  we  call  DIAGRAM  to  proceed  with  the  construction  with 
STABS(i)  as  goal,  the  output  cell  of  this  level  as  param,  NOGO  as  the 
error  exit,  is  true  (since  we  know  nothing  about  the  branching  for 
the  new  sentence  we  are  going  to  be  working  on),  is  the  Sg  for 
this  level,  and  is  obtained  as  follows:  the  value  of  is  F  if 

either  Sg  =  F  or  T(i)  =  3,  T  if  Sg  =  T  and  T(i)  3.  Sg  and  Sg 
pertain  to  the  output  in  a  manner  illustrated  in  a  subsequent  example 
and  will  not  be  dealt  with  here.  If  DIAGRAM  is  successful  it  returns 
to  CONTN,  where  we  go  on  the  STABS(i+l). 

NOGO:  We  arrive  here  when  we  cannot  build  STABS(i)  starting  with  INPUT  (j). 
SW  is  tested;  if  SW=T,  then  INPUT (j)=STABS(i)  and  we  go  to  CONTN. 

If  SW=F,  then  we  must  abandon  this  branch.  This  is  done  at  RETRA. 

RETRA:  J  — ^  j,  K  — k;  that  is, reset  the  input  and  output  indexes  to  the 
values  they  had  at  the  beginning  of  this  call.  We  have  discovered  that 
STABS(i)  could  not  be  constructed,  so  we  reset  j  and  k  and  proceed 
to  STRTl. 

STRTl:  STC(i)  - — ^  i,  and  test  i.  That  is,  we  replace  i  with  STC(i)  and 
then  test  this  new  value  of  i.  If  it  is  non-zero  branching  is  possible, 
so  set  S2  to  T  and  go  back  to  START  to  try  to  construct  the  al¬ 
ternate  syntactical  element  STABS(i).  If  i  is  0 ,  reset  i  to  I,  the 
original  value  of  i,  and  go  to  NEWST. 

(NEWST):  When  we  arrive  here  we  know  that  nothing  can  be  constructed.  If 
STABS(I)  was  a  subject,  or  could  not  be  built,  we  looked  at 
STABS  (STC  (STC(. . .  STC(I).  . . )))  until  no  more  branches  were  left. 

Now  we  loop  back  through  the  same  branches,  but  this  time  we  look 
for  subjects  instead  of  components  to  be  constructed.  For  every  sub¬ 
ject  we  attempt  to  start  the  construction  corresponding  to  a  sentence 
starting  with  that  subject.  We  proceed  as  follows: 
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NEWST;  Is  STABS(i+l)  =  {  ?  If  not,  we  jump  to  NPTHl.  Otherwise  STABS(i) 
is  a  subject,  and  we  put  STC(i+l)  into  OTCEL,  the  output  cell.  This 
is  the  index  pointing  to  the  definition  in  the  STABD  table  which  cor¬ 
responds  to  the  sentence  just  assembled.  Next  set  to  T  and 
test  STABS(i)  to  see  if  it  is  the  goal  we  are  seeking.  If  STABS(i) 
GOAL  we  set  SW  to  F  and  test  is  F  then  we  have 

already  performed  the  SUCCR  of  NEWS4  and  found  it  true  when  we 
were  doing  the  lookahead,  so  skip  to  NEWS3.  If  S2  is  T,  we  go  to 
NEWS4.  If  STABSd)  =  GOAL,  SW  is  set  to  T  and  -K  is  put  into 
PARAM,  which  is  the  output  cell  of  the  level  which  called  for  the 
construction  of  the  current  goal.  Proceed  to  NEWS4. 

NEWS4;  Perform  SUCCR  (STABSd),  GOAL).  If  no,  jump  to  NOPTH;  if  yes, 
proceed  to  NEWS3. 

NEWS3:  SUCCR  was  true,  so  we  can  build  the  goal  starting  with  STABS(i). 

If  STABS(i)  =  GOAL  this  means  the  construction  of  GOAL  is  recur¬ 
sive.  We  perform 

DIAGRAM  {TRAN  (STABS(i)),  GOAL,  PARAM,  NOPTH,  T,  S3,  T)  to  try  to 
proceed  with  the  construction  of  the  goal.  If  this  call  is  successful, 
go  to  FOUND;  otherwise  to  NOPTH. 

NOPTH:  GOAL  cannot  be  constructed  starting  from  STABS{i),  so  we  test  SW 
to  see  if  STABS(i)  =  GOAL  (thus  SW  saves  us  from  having  to  per¬ 
form  this  test  twice).  If  SW  is  true,  go  to  FOUND;  if  false,  go  to 
NPTHl. 

NPTHl:  Replace  i  with  STC(i)  and  test  If  the  new  value  of  i  is  non-zero, 
go  to  NEWST.  If  it  is  zero,  reset  j  to  J,  k  to  K,  and  go  to 
ERROR,  the  error  exit  for  this  call. 

CONTN:  STABS(i)  has  been  found,  so  we  call  DIAGRAM  to  continue  to 
STABS(i+l).  So  we  perform 
DIAGRAM  (i+1,  GOAL,  PARAM,  RETRA,  S2,  Sj.T) 

The  error  exit  is  to  RETRA,  the  normal  return  is  to  FOUND. 

FOUND:  If  S3,  S^,  and  Sg  are  all  true,  OTCEL  is  put  into  OUTPUT(k), 

which  is  the  k*'—  element  of  OTP,  and  k  is  replaced  with  k+1.  If 
S3  or  S_j  is  false,  no  output  is  put  out  and  k  is  not  changed .  If  Sg 
is  false  OTCEL  is  put  into  PARAM  and  k  is  not  changed.  After  any 
of  these  possibilities  a  normal  return  is  executed. 


7  TRANSYM 


Like  DIAGRAM,  TRANSYM  is  a  recursive  function,  although  it  generally  does 
not  use  nearly  as  many  recursions.  Its  input  is  OTP  and  its  output  is  a  string 
which  is  generally  fed  character  by  character  directly  into  the  output  converter. 

A  line  is  printed,  card  is  punched,  etc.,  as  soon  as  enough  characters  are  gener¬ 
ated,  thus  preventing  a  build-up  of  characters  waiting  to  be  output.  Some  of  the 
characters  are  treated  differently  from  others  in  that  they  cause  special  actions. 
Example.s  of  these  are:  output  current  card,  skip  to  Column  8,  etc.  These  are' 
dealt  with  in  the  appendixes. 

The  arguments  of  TRANSYM  are  N,  RR,  PUTIKER,  and  P.  RR  is  the  address 
of  a  direct  link  in  OTP,  STABD(N)  is  the  first  definition  unit  TRANSYM  is  to  look 
at  on  this  call,  and  PUTIKER  is  a  Boolean  variable  controlling  the  output.  If 
PUTIKER  is  T,  the  output  goes  to  the  output  converter;  if  F,  the  output  is  put 
into  OTPX,  a  temporary  storage  area  for  holding  output  to  be  later  transferred  to 
the  substitution  table  or  output  which  serves  as  the  arguments  for  the  special  func¬ 
tions  F  .  P  is  described  below, 
n 

TRANSYM  reserves  certain  information  in  a  stack  (i.  e. ,  push-down  space) 
just  as  DIAGRAM  does.  These  are  the  following  nine  quantities:  N,  RR,  PUTIKER, 
EXIT,  VKEEP,  TE,  SUBXKEEP,  SUBLKEEP,  P,  and  a  quantity  x^  which  is 
really  part  of  F^  and  F^  and  will  be  discussed  in  the  appendixes.  EXIT  is  ex¬ 
actly  what  the  name  implies  and  P  is  the  counter  for  the  left  and  right 
metabraces  =  (#  of  {  's)  -  (#  of  ) 's)  -  1.  The  -1  appears  since  P=0  at  the 
start  of  a  definition  and  we  start  with  the  character  following  the  initial  (  . 

SUBXKEEP  and  SUBLKEEP  (hereafter  referred  to  as  S  and  U  )  are  the  ad- 

o  o 

dresses  of  the  last-used  spaces  in  the  substitution  list  and  the  index  to  that  list, 
respectively.  TE  is  a  cell  used  for  the  saving  of  (essentially)  the  n  found  after 
a  P  or  F.  VKEEP  is  used  for  remembering  the  value  V,  the  OTPX  index,  for 
later  use. 

TRANSYM  initially  sets  t  equal  to  the  value  N  and  goes  to  PI. 

PI:  A  test  is  made  of  STABD(t).  If  the  character  is  a  (  we  go  to 

WASLB, 

if  a  }  we  go  to  WASRB, 

if  a  ^  we  go  to  WASPR, 

if  a  P  we  go  to  WASSP, 

if  an  F  we  go  to  WASSF, 

if  a  J  we  go  to  RETRN 

if  a  ]]  we  go  to  RETRN,  and  if  the  character  is  none  of  these  (it 

is  called  a  'simple  symbol'),  we  go  to  SIMSM. 

SIMSM:  OUTSUB(t)  is  performed;  t+1  replaces  t  and  N,  and  we  go  back 
to  PI. 
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OUTSUB(tl:  This  subroutine  checks  the  substitution  indexing  list  S(u)  to 

see  if  the  character  STABD(t)  occurs;  if  it  does,  the  appropriate 
substitution  is  made.  The  mechanism  is  as  follows: 

OUTSB:  U  is  set  equal  to  (from  SUBLKEEP). 

OTS3:  Test  u;  if  u=0  output  STABD(t)  and  return  to  calling  routine. 

Otherwise  compare  STABD(t)  with  S{u);  if  not  equal,  decrease  u 
by  1  and  go  back  to  OTS3.  If  equal,  then  there  is  a  substitution  to 
be  made.  The  list  A,  which  runs  parallel  to  S,  contains  the  address 
of  the  first  character  of  the  substitution  string. 

OTS2:  We  set  s=A(u).  s  is  the  variable  running  through  the  substitution 

strings  which  are  stored  in  area  B. 

OTS4:  B{s)  is  tested  to  see  if  it  is  the  character  ENDSYM  (octal  777  is  re¬ 

served  for  this  special  character).  If  it  is,  an  exit  to  the  calling 
routine  is  performed.  If  not,  then  B(s)  is  output,  s+1  replaces  s, 
and  we  go  back  to  OTS4. 

OUTPUT(x)  :  This  subroutine  tests  the  current  value  of  PUTIKER.  If  T,  x 
is  fed  to  the  converter.  If  F,  x  is  put  into  OTPX(v)  and  v+1  re¬ 
places  V, 

There  is  also  a  special  subroutine  OTPTN,  which  is  the  same  as 
OUTPUT  except  that  it  assumes  the  value  F  for  PUTIKER. 

WASLB:  Increase  P  by  1;  output  a  {  ;  replace  t  and  N  with  t+1; 

return  to  PI. 

WASRB:  Test  P;  if  0  go  to  RETRN;  if  not  zero  decrease  P  by  1,  output 
a  )  ;  replace  t  and  N  with  t+1,  and  return  to  PI. 

RETRN:  We  have  come  to  the  end  of  this  call  of  TRANSYM,  so  an  exit  is  per¬ 
formed. 

WASPR:  Proceed  to  count  the  number  of  primes,  q  is  set  equal  to  1,  and 
t+1  replaces  t.  STABD(t)  is  tested;  if  it  is  a  '  ,  q  and  t  are 
increased  by  one  and  the  process  repeated.  When  a  character  other 
than  a  prime  is  finally  encountered,  q  is  compared  with  P.  If  q=P, 
STABD(t)  is  tested.  If  it  is  a  P  or  F  a  jump  is  performed  to 
WSSPI  or  WSSFI,  respectively;  otherwise,  an  error  is  noted.  If 
q  P,  then  q  primes  are  output,  and  we  go  to  WSPR4. 

WSPR4;  The  P  (or  F)  is  output  along  with  alt  other  characters  up  to  and  in¬ 
cluding  the  J  which  matches  the  P  (or  F).  t  is  increased  by  one 
with  every  character  that  is  output.  The  final  value  of  t,  then,  is 
such  that  STABD(t)  is  the  character  after  the  ]].  N  is  set  to  this 
final  value  of  t  and  control  is  transferred  back  to  PI. 
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WASSP:  We  come  here  knowing  that  STABD(t)  =  &  and  that  there  are  no 

primes  before  it.  So  P  is  tested.  If  P  0,  transfer  to  WSPR4. 

If  P  =  0,  go  on  to  WSSPl. 

WSSPl:  The  value  of  the  octal  integer  n  following  is  obtained,  added  to 
RR,  and  the  sura  stored  in  TE.  This  is  the  address  in  OTP  of  the 
link  pointing  to  the  definition  of  CP^.  The  link  may  be  either  direct 
or  indirect;  this  will  be  examined  later,  t  is  increased  to  point  to 
the  character  after  n. 

WSSP3:  STABD(t)  is  examined.  If  STABD(t)  =1,  control  is  transferred  to 

DOP;  if  STABD(t)  =  ’  we  proceed;  otherwise,  an  error  condition  is 
noted.  At  this  point  VKEEP  (hereafter  referred  to  as  V^)  is  set 
equal  to  v,  the  current  value  of  the  OTPX  index.  Then  t  is  in¬ 
creased  by  one  and  STABD(t),  the  character  for  which  the  substitution 
is  to  be  made,  is  put  into  OTPX(v)  and  v+1  replaces  v  (this  is  done 
through  the  OTPTN  (STABD(t)  )  subroutine).  Again  t  is  increased, 
and  STABD(t)  is  checked  to  see  that  it  is  the  character  —  .  If  it  is, 
t  is  increased  by  one  and  we  proceed;  if  not,  an  error  condition  is 
noted.  At  this  point  STABD(t)  is  the  first  character  of  the  string 
which  is  to  be  substituted  for  the  character  in  OTPX(V^).  Now  this 
string  is  arbitrary;  that  is  to  say,  it  may  be  null  (so  that  STABD(t)  =  J 
or  J),  or  it  may  be  simple  characters,  but  it  may  contain  iP's  and 
9^'s  which  will  need  further  translation.  For  this  reason  it  is 
necessary  to  have  TRANSYM  analyze  the  string.  This  is  done  by 
performing  TRANSYM  (t,  RR,  F,  P),  which  starts  translating  with 
STABD(t)  and  continues  until  it  gets  to  either  (  or  ]]  ,  whereupon  it 
returns  to  the  present  call.  All  output  from  this  translation  is  put 
into  OTPX  starting  at  V^+1  since  PUTIKER  =  F.  Note  that  all  sub¬ 
stitutions  which  are  to  be  made  as  a  result  of  previous  calls  are  done 
during  this  translation,  so  that  the  string  in  OTPX  is  in  need  of  no 
further  analysis .  OTPTN  (ENDSYM)  is  performed,  thus  putting  the 
character  ENDSYM  at  the  end  of  the  string  in  OTPX.  We  have  now  to 
transfer  this  string  from  OTPX  to  the  area  B  and  put  the  proper 
entries  into  S  and  A.  This  is  done  as  follows: 

a. )  Set  v^  equal  to  the  current  value  of  v, 

set  v^  equal  to  v^+1,  and 
set  s  equal  to  s^  . 

b. )  Increase  s  by  one. 

c. )  Put  OTPX(v2)  in  B(s). 

d. )  Increase  by  one  and  test.  If  it  is  less  than  v^  ,  go  back  to  (b); 

if  =  v^  ,  go  to  (e). 
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e. )  Replace  U  with  U  +1. 

^00 

f. )  Put  OTPX  (v^)  in  S  (U  and  pul  in  A  (U^). 

g. )  Replace  with  S. 

Thus  we  see  that  and  always  point  to  the  last  used  cells  in  S  and  A, 
and  B,  respectively,  while  v  points  to  the  next  vacant  cell  in  OTPX. 

Since  we  are  now  done  with  the  string  in  OTPX,  v  is  reset  to  the  value  v^, 
and  a  transfer  to  WSSP3  is  executed.  Note  that  t  is  left  pointing  to  the  J  or  J 
following  the  substitution  string  analyzed  by  the  TRANSYM  call  mentioned  above. 

OOP:  N  is  set  equal  to  t+1,  which  points  to  the  character  following  the  J. 

This  is  for  resetting  t  after  the  next  call  of  TRANSYM.  We  are  now 
ready  to  get  the  defini:ion  corresponding  to  (P To  do  this  we  first 
examine  OTP(TE).  II’  this  is  indirect  (i.e.,  negative)  then  we  set 
TE  =  |OTP(TE)  I .  If  it  is  direct  we  leave  it  alone.  OTP(TE)  now 
points  directly  to  the  first  character  in  the  definition  for  (P Thus 
we  perform  TRANSYM  (OTP(TE),  TE,  PUTIKER,  0),  which  gets  the 
appropriate  output  and  puts  it  in  OTPX  or  gives  it  to  the  output  con¬ 
verter,  according  to  the  current  value  for  PUTIKER.  After  per¬ 
forming  this  we  go  to  PI2. 

WASSF:  At  this  point  it  is  known  that  STABD(t)  and  that  no  primes 

immediately  precede  this  Therefore  P  is  tested.  If  P  0, 
transfer  to  WSPR4.  If  P  =  0 ,  go  on  to  WSSFl. 

WSSFl:  The  value  of  n  is  obtained,  added  to  FUNBASE,  and  the  sum  stored 

in  TE.  This  is  the  address  of  the  first  instruction  for  ^  ,  and 

n 

control  will  be  transferred  there  after  the  arguments  (if  there  are 
any)  have  been  translated.  We  set  v^  to  the  current  value  of  v  and 
go  to  WSSF2. 

WSSF2:  Test  STABD(t).  If  it  is  L  go  to  DOF.  F  it  is  )  goon;  otherwise 
indicate  error,  t  is  increased  by  one  arid  TRANSYM  (t,  RR,  F,  P) 
is  called.  This  translates  the  argument  string  and  puts  the  output  in 
OTPX,  then  returns  with  I  pointing  to  the  )  or  ]]  at  the  end  of  the 
argument.  OTPTN  (ENDSYM)  is  performed,  thus  placing  ENDSYM 
at  the  end  of  tlie  completely  formed  argument  in  OTPX.  Control  is 
then  transferred  back  to  the  beginning  of  WSSF2. 

DOF:  V  is  reset  to  v  ,  N  is  set  to  l+l,  and  a  transfer  to  the  address  in 

o 

TE  is  executed,  v  is  in  the  accumulator  (or  some  other  known 
o 

place),  and  tliis  tells  where  the  arguments  begin  in  OTPX.  5",., 
further  knows  that  ENDSYM's  separate  the  arguments.  returns 
to  PI2. 

Set  t  =  N  and  go  to  PI. 


PI2: 


Appendix  1 


SPECIFIC  INSTRUCTIONS  FOR  OPERATION  OF  PRESENT  SYSTEM 

The  present  system  is  not  in  a  form  which  is  considered  useful  for  general 
programmers.  Special  functions  and  modifications  for  input-output  control  must  be 
put  in  as  mod-cards  using  the  SOS  system.  This  is  difficult  for  users  who  do  not 
know  the  IBM  7090  very  well.  Several  methods  for  making  the  system  easier  to  use 
are  presently  being  studied,  and  results  of  these  studies  will  be  supplied  later.  This 
paper  describes  only  that  which  is  currently  in  use. 

The  general  procedure  for  running  is 

a)  a  definition  1  call. 

b)  a  syntax  generator  call. 

c)  a  definition  2  call. 

d)  a  diagram  call  (this  calls  transym  if  it  is  successful,  otherwise  comes 
back  to  the  directing  program). 

The  directing  program,  which  has  been  named  SEER  for  no  particular  reason, 
recognizes  the  following  control  cards: 

DEFNl 

SNTXG 

DEFN2 

DIAGR,  GOAL,  PUNCH, 

DIAGR,  GOAL,  PRINT, 

DONE 

DUMP 

LOGl 

LOG2 

LOGS 

Each  begins  in  card  column  8.  The  only  variable  in  the  above  is  GOAL,  for 
which  is  substituted  the  name  of  the  actual  syntactical  element  to  be  found  by  dia¬ 
gram.  We  will  study  the  action  of  the  programs  called  when  these  cards  are 
encountered. 

DEFNl 

The  routine  DEFNl  sets  up  a  1-1  correspondence  between  the  names  to  be 
used  for  syntactical  elements  and  the  nine-bit  binary  numbers  used  by  the  machine 
to  refer  to  those  elements.  These  names  will  be  used  only  in  the  SNTXG  part  for 
inputting  the  descriptions,  except  for  the  elements  which  occur  as  goals  on  the 
DIAGR  call  cards. 
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The  input  to  DEFNl  starts  on  the  card  following  the  call  card.  A  name  is  read 
in  and  assigned  the  number  currently  in  a  counter;  then  the  counter  is  advanced 
one  and  the  next  name  input,  etc.  A  name  may  consist  of  any  number  of  characters, 
although  only  the  first  six  are  regarded  as  significant.  A  name  is  terminated  by  a 
comma,  and  blanks  following  the  comma  and  preceding  the  next  name  are  ignored. 
Column  72  of  one  card  is  regarded  as  contiguous  with  column  1  of  the  next.  While 
=,  +,  — ,  and  /  are  themselves  ligitimate  names,  no  name  of  more  than  one 
character  may  begin  with  one  of  them.  However,  they  may  occur  within  a  name. 

Of  course  a  comma  "  may  not  occur  anywhere  within  a  name.  The  reasons 
for  the  +,  — ,  and  /  being  excluded  from  the  first  character  of  a  name  will  be 
evident  from  SNTXG,  but  the  =  character  has  a  special  significance  in  DEFNl, 
for  it  is  used  to  set  the  counter.  It  may  be  followed  by  one  of  two  characters,  O 
or  D.  This  letter  is  in  turn  followed  by  from  one  to  three  digits.  If  the  letter  was 
an  O  the  digits  are  regarded  as  octal,  if  a  D,  as  decimal.  The  digits  are  fol¬ 
lowed  by  a  comma.  Thus  =O0,  =0177,  =D10,  =D511,  set  the  counter  to  000,  177, 
012,  and  777,  respectively  (all  octal).  Note  that  511  is  the  largest  number  which 
can  follow  =D,  and  777  that  can  follow  =0. 

DEFNl  is  ended  by  putting  the  character  string  DFIEND  in  as  the  last  name. 
When  DEFNl  finds  this  name  it  immediately  returns  control  to  SEER. 

SNTXG 

The  problems  that  arose  in  reading  in  the  syntax  description  were  mostly  the 
result  of  the  small  character  set  available.  This  was  the  case  particularly  in  the 
definition  part  of  the  descriptions.  The  best  way  to  explain  the  method  used  is  to 
give  an  example.  Consider  the  following  syntax  statements: 


symbol 

=  sum  = 

:  :  stmnt 

{  P^  ]]STO  P^  E  1®  ) 

symbol 

X  term  = 

:  :  term 

==^  {  P^  E  XCA  11  FMP  P3  E®  ) 

symbol 

X  symbol  = 

:  :  term 

{  LDQ  P^  E  1®  RMP  P^  E 

ffi) 

symbol 

=  :  :  sum  = 

{  CLA  IP^ 

E  IE  ) 

(sum) 

=  :  :  term=" 

->  {  P2  11  ) 

term 

X  term  = 

:  ;  term 

=^{  P3  E  STO  T  IE  P^  )  T.^ 

-^1  E  E 

XCA  IE  FMP  TIE  ) 

term 

X  symbol  = 

:  :  term  = 

{  Pj  BXCA  IE  FMP  P^  El®  ) 

term 

=  :  :  sum  = 

=^{  P^j 

where  IK  stands  for  a  carriage  return  (or  end  of  line)  signal  to  the  output  con¬ 
verter.  Writing  this  so  that  the  STC  links  are  clearly  evident  we  get 
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symbol  = 

stmnt  = 

\ 

V 

{  term  =  :  : 

1 

term  = 

symbol  =  :  : 

term  = 

{ 

=  :  :  sum  ==^  1  .  .  . 


(  sum  ) 
term 


X 

Y 


:  term 


term 

symbol 


sum= 


{  .  . 


term 

term 


where  the  definitions  have  been  omitted  for  clarity. 

The  input  corresponding  to  these  statements  is 
SYMBOL,  =,  SUM,  =  STMNT,  (/PI/,  STO/P3/,  /  *776) 

,  X,  TERM,  =  TERM,  (/PI/,  XCA/*776FMP/P3/,  /  *776) 

,  ,  SYMBOL,  =  TERM,  (LDQ/P3/, /*776FMP/P1/, /*776) 

,  =  SUM,  (CLA/Pl/, /*776) 

(,SUM,  ),  =  TERM,  (/P2/,) 

TERM,  X,  TERM,  =  TERM,  (/P3/,  STO  T/ *776/P  | ,  T/ . /F  | /, /,  XCA 


/*776FMP  T/*776) 

,  ,  SYMBOL,  =  TERM,  (/P3/,  XCA/* 776FMP  /P  | /,  / *776) 

,=  SUM,  (/P|/,) 

Note  that  consecutive  syntactical  elements  are  separated  by  commas,  and  that 
blanks  between  commas  and  between  a  comma  and  the  name  following  it  are  ignored. 
Note  that  the  commas  specify  where  an  STC  link  is  to  fall,  since  it  points  to  the 
first  name  in  a  description  (i.e.  ,  the  first  thing  not  blanks  or  commas)  from  the 
corresponding  element  in  one  of  the  preceding  descriptions.  This  corresponding 
element  is  found  by  going  upward  until  wc  find  a  non-null  name  which  is  preceded 
by  the  same  number  of  commas.  If  the  name  is  preceded  by  a  -,  T  (i)  is  set  to  1 
for  i  corresponding  to  where  that  element  is  put  into  STABS.  If  preceded  by  a  /, 

T  (i)  is  set  to  3.  If  an  octal  number  is  preceded  by  a  +,  the  T  (i)  is  set  to  2, 
and  the  octal  number  is  put  into  STABS;  that  is  to  say,  the  octal  number  is  not 
considered  as  a  name.  This  octal  number  corresponds  to  an  interal  function  called 
by  DIAGRAM.  The  name  of  the  subject  is  preceded  immediately  by  the  chai’acter  =, 
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and  following  the  subject  the  definition  begins  with  the  character  (,  which  may  be 
preceded  by  an  arbitrary  number  of  blanks.  Starting  with  that  (,  the  characters 
are  treated  differently  than  those  before  the  definition.  The  relationship  is  as 
follows: 


/p 

is  translated  to 

F 

(750g  internally) 

/F 

II  11 

1  1 

r 

(751) 

( 

It  It 

I  I 

{ 

(754) 

) 

tl  It 

1 1 

) 

(755) 

1 

It  I  1 

1 1 

t 

(757) 

f 

I  !  It 

1 1 

i 

» 

(752) 

1, 

It  M 

1 1 

1 

(753) 

1. 

tl  tl 

1 

- 

(756) 

II  II 

1  1 

a 

(where  a  is  any 
/  (  goes  in  as 

character  except 
(,  not  {  ). 

0-7.  Thus 

/*ddd 

M  II 

1 1 

(ddd) 

(where  ddd  is  a 

three  digit  octal 

number) 

If  slash  (/)  is  followed  by  any  character  other  than  P,F,  comma,  or  *  ,  the 
slash  and  the  character  go  on  into  STABD  as  ordinary  characters.  Thus 
/Pl/,STO  /P3/./«776) 

(remember  that  the  initial  {  does  not  go  into  STABD) 

becomes:  750  001  753  062  063  046  060  750  003  753  776  755 

corresponding  to:PlBSTO  P3  DIE  } 

The  use  of  /*  ddd  is  very  important,  since  it  enables  us  to  use  characters 
100-747  and  760-776. 

The  characters  760-776  have  been  reserved  for  format  control  of  the  output. 
Of  these  only  773  to  776  have  been  assigned.  The  output  converter  recognizes 
these  and  performs  the  following  operations  accordingly: 

776  output  current  line  (card)  and  set  up  for  next. 

775  skip  to  column  2  of  card. 

774  skip  to  column  8. 

773  skip  to  column  16. 

At  present  the  output  converter  recognizes  no  other  characters  but  these.  It 
simply  tests  the  character  to  sec  that  it  is  below  100;  having  ascertained  this  the 
initial  0  is  dropped  and  the  remaining  two  digits  are  put  into  the  final  output 
string.  These  two  digit  combinations  arc  simply  the  binary  coded  decimal  repre¬ 
sentations  for  the  output  alphabet.  When  72  characters  are  obtained  they  are  out¬ 
put  as  a  line  (or  card),  except  that  776  also  causes  an  output.  If  the  output 
converter  is  fed  a  character  between  100  and  772  it  signals  an  error  condition. 
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The  end  of  the  syntax  descriptions  is  signaled  by  putting  SNTEND  anywhere  on 
a  card  after  the  last  description  card. 

DEFN2 

In  order  to  understand  the  action  of  this  routine  the  reader  must  first  learn 
more  about  the  input  converter.  Ordinarily  it  simply  takes  the  BCD  representation 
for  the  input  character  (21  for  A),  appends  an  initial  zero  so  that  it  has  the  re¬ 
quired  nine  bits  (021  for  A),  and  puts  this  in  the  input  string  for  DIAGRAM.  How¬ 
ever,  the  prime  (',  or  14  in  BCD)  causes  a  different  action.  The  input  is  read  in 
until  the  next  prime  is  reached,  and  the  characters  are  treated  as  forming  a 
symbol.  Thus  'ALPHA'  is  the  symbol  alpha,  'XI 'is  the  symbol  XI.  The  input 
converter  then  looks  for  this  symbol  (only  the  first  6  characters  if  it  has  more  than 
6  characters)  in  a  table.  When  found,  the  corresponding  number  is  put  into  the  in¬ 
put  string.  This  table  is  constructed  as  follows: 


It  is  this  table  that  DEFN2  constructs.  The  input,  which  starts  anywhere  on 
the  card  following  the  DEFN2  card,  consists  of  pairs.  S,  ddd,  S, ddd,  S,ddd,  . . .  , 
where  S  is  a  symbol  of  arbitrary  length,  and  ddd  is  a  three  digit  octal  number. 
Any  number  of  blanks  may  precede  an  S,  none  may  be  between  the  comma  and  the 
octal  number.  A  symbol  is  assigned  the  number  which  follows  it.  Thus  ALPHA, 
012,  XI,  124  yields  the  table  entries 


so  that  the  input  'XI'  =  2+ 'ALPHA'  results  in  the  input  string 
12401300202001  2. 
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DEFN2  returns  control  to  SEER  when  it  discovers  the  symbol  DF2END,  so 
these  cannot  be  used  as  the  first  six  characters  of  an  input  symbol. 

DIAGR,  GOAL.  PUNCH,  (or  DIAGR,  GOAL,  PRINT, ) 

This  card  causes  the  following  operations  to  be  performed 

(a)  The  SUCCESSOR  table  is  built  up. 

(b)  The  next  card  is  checked.  If  it  contains  only  ENDS  in  columns  8-11 
and  no  other  characters  on  the  card,  control  is  returned  to  SEER. 
Otherwise  control  goes  to  the  input  converter. 

(c)  The  input  converter  puts  the  input  into  the  input  string,  reading  in 
new  cards  as  necessary,  until  it  hits  a  card  having  ENDM  in 
columns  8-11  and  the  rest  blank.  Then  we  proceed  to  (d). 

(d)  DIAGRAM  (TRAN(INPUT  (1)  ),  GOAL,  SEERl,  EROES,  T,  T,  T) 
is  called.  This  will  result  in  the  input  being  diagrammed  and  a 
normal  return  if  everything  is  all  right.  However,  if  DIAGRAM  is 
unsuccessful  and  it  returns  to  ERORS,  an  error  message  is  printed 
out  and  we  go  back  up  to  (b).  After  a  normal  return 

TRANSYM  (D(SEERl),  (SEERl),  T,0)  is  called.  (SEERl)  is  simply 
the  number  (changed  to  plus)  put  into  SEERl  by  DIAGRAM,  and  is 
the  address  of  the  direct  link  in  OTP  which  points  to  the  definition 
TRANSYM  is  to  start  with.  D(SEERl)  is  the  index  of  the  first 
character  in  that  definition.  After  TRANSYM  ends  its  action,  re¬ 
turn  to  (b)  above. 

DONE 

This  card  returns  control  to  the  SOS  monitor. 

DUMP 

This  card  causes  a  complete  dump  of  core,  using  the  SOS  macro  instruction 
"CORE. " 

LOGl,  LOG 2,  and  LOG  3 

These  cards  are  used  to  print  comments  on  line.  LOGl  causes  the  card  it  is 
on  plus  the  next  card  to  be  printed,  then  returns  to  SEER.  LOG 2  is  the  same  ex¬ 
cept  that  following  the  second  line  it  causes  an  eject.  LOG 3  is  the  same  as  LOG 2 
except  that  following  the  eject  the  computer  executes  a  HPR  (halt  and  proceed)  in¬ 
struction.  Pushing  the  start  returns  control  to  SEER.  This  can  be  used  to  give 
instructions  to  the  operator. 
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Appendix  2 


Consider  the  following  syntactical  statements: 

A,  =  LETTER,  (A) 

B,  =  LETTER,  (B) 

C,  =  LETTER,  (C) 

0,  =  DIGIT,  (0) 

1,  =  DIGIT,  (1) 

2,  =  DIGIT,  (2) 

LETTER,  DIGIT,  =  SYMBOL,  (/P2/,P1/,) 

SYMBOL,  =,  SUM,  =  STMNT,  (/Pl/,STO/P3/, /’'‘TTe) 

,  X,  TERM,  =  TERM,(/P1/,XCA/’^776FMP/P3/, /*776) 

,  ,  SYMBOL  =  TERM,  (LDQ/P3/, /*776FMP/P1/,  / ’776) 

,+  ,  TERM,  =  SUM,  ( /P1/,FAD/P3/, /’'776) 

,  ,  SYMBOL  =  SUM,  (CLA/P3/,f776FAD/Pl/,/’‘776) 

,  SUM,(CLA/P1/,  /  ^776) 

TERM,  X,  TERM,  =  TERM,  (/P3/,  STO  T/*776/Pl,  T/ . /FI/, /,  XCA/’-'776FMP 

T/'‘--776) 

,  ,  SYMBOL,  =  TERM,  (/P3/,XCA/'''776FMP/Pl/,/’‘776) 

,  =  SUM,(/P1/,) 

SUM,  +,  TERM,  =  SUM,  (/Pl/,STO  T/*776/P3,T/. /Fl/,/,FAD  T/’‘'776) 

,  ,  SYMBOL,  =  SUM,(/P3/,FAD/P1/, /’‘776 

(,SUM,),  =  TERM,(/P2/,) 

The  function  t ^  generates  a  new  symbol  each  time  it  is  called.  For  ex¬ 
ample,  it  could  generate  zOOOl,  z0002,  z0003,  etc.  for  consecutive  calls. 

The  input  we  will  consider  is: 

123  45  6  789  10  11  2345678  9  20  1234  25 
B2  =  A1  +  (BO  X  (  AO  +C2)  )  +  B1  X  A2. 

The  initial  call  of  DIAGRAM  would  be 
DIAGRAM!  TRAN(INPUT(1)  ),  STMNT,  Oq,  ER0RS,TRUE,  TRUE) 

The  tables  generated  for  this  syntax  are  the  following,  with  STABS  and  STABD 
combined.  Note  that  this  means  that  when  a  direct  link  is  put  into  an  OTCEL,  it 
will  be  i  +  2  instead  of  STC  (i+1). 
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STC(i)  and  STAB(i) 


In  the  following,  which  is  primarily  a  STAB  table,  almost  all  the  STC  entries 
are  zero.  For  this  reason  the  nonzero  entries  have  simply  been  enclosed  in  paren¬ 
thesis  and  placed  after  the  corresponding  STAB  entry. 


0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

00 

- 

letter 

{  (3) 

A 

} 

letter 

{  (7) 

B 

} 

Letter 

10 

{(11) 

C 

} 

digit 

{(15) 

0 

} 

digit 

{(19) 

1 

20 

} 

digit 

{  (23) 

2 

) 

digit 

symbol 

{(28) 

P 

2 

30 

H 

P 

1 

B 

} 

=(51) 

sum 

stmnt 

{(39) 

P 

40 

1 

I 

S 

T 

0 

A 

p 

3 

B 

ffi 

50 

} 

x(91) 

term(7 1) 

term 

{(55) 

P 

1 

B 

X 

C 
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OutDut  from 

Transvm 

LDQ 

B1 

FMP  . 

A2 

STO 

T 

CLA 

AO 

FAD 

C2 

XCA 

FMP 

BO 

FAD 

A1 

FAD 

T 

STO 

B2 

Several  things  should  be  noted  at  this  point. 

a)  OTP  (x)  for  x  =  11,  16,  21,  26,  35,  41,  44,  and  50  contain  no  in¬ 
formation,  since  these  outputs  were  caused  by  recognition  of  +'s,  x's,  and  )'s  , 
This  is  a  special  case  of  the  following  general  problem.  Consider  the  syntactical 
statement: 


“4^  “3'  “2'  “l'"  ^  ^  *®’l^  ^"2'  ®’3’  *"4  ^  ^ 

where  G  stands  for  the  fact  that  the  definition  for  a  is  in  general  a  function  of 
a^,  a^,  and  .  Any  output  for  will  have  been  determined  before  this 
statement  is  examined.  It  will  always  have  a  direct  link  put  in  OTP  for  it  if  it  is 
necessary,  nothing  otherwise.  So  the  following  remarks  apply  only  to 
and  .  Suppose  the  definition  for  a  does  not  depend  on  that  of  in  any  way, 
where  is  either  1,  2,  or  3.  Then  it  is  desirable  to  waste  no  OTP  space  on 
these  definitions.  This  is  precisely  what  is  accomplished  by  setting  T{i)  =  1  for 
this  particular  element.  If  is  a  simple  character  (such  as  +)  which  is  simply 
recognized  directly  from  the  input,  no  entry  is  put  in  OTP  by  the  call  of  DIAGRAM 
which  does  this  recognizing.  E  is  not  p  simple  character,  then  the  parameters 
and  are  used  to  carry  the  "no  output"  information  to  the  calls  of 
DIAGRAM  which  construct  .  This  results  in  no  indirect  entry  being  put  into 
OTP  for  ,  and  further  no  entries  are  put  into  OTP  for  any  output  which  would 
normally  result  during  the  building  of  .  This  normally  results  in  a  25%  to 
30%  saving  in  OTP  space  used,  depending  on  the  syntax.  At  present  a  —  is 
put  in  front  of  a  syntactical  unit  for  which  no  OTP  is  needed.  Thus  if  G  does  not 
depend  on  the  definition  of  we  write  a^,  ^3) } 

where  now  P^  will  get  the  definition  of  , 

P2  will  get  the  definition  of  , 

and  P^  will  get  the  definition  of  . 
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Note  that  it  is  which  is  eliminated,  and  that  the  definitions  referred  to  by 

and  are  changed.  A  brief  consideration  of  the  construction  of  OTP  will 
show  why  this  is  true. 

b)  We  see  that  it  is  not  necessary  that  OTP(x)  for  x  =  7,  13,  23,  37, 

46,  and  52  be  indirect.  We  could  just  as  well  replace  these  negative  values  of 
OTP  with  the  direct  values  to  which  they  point,  and  eliminate  the  latter,  since  they 
are  no  longer  used.  Thus  OTP  (7)  would  be  changed  to  23  and  OTP(5)  would  be 
deleted.  In  general  this  change  can  be  made  for  all  indirect  links  which  point  to 
direct  links  which  point  to  simple  definitions,  i.e. ,  definitions  which  contain  no 
P's  .  In  our  example  we  see  that  this  situation  occurs  for  "digit"  in  the  sentence 

LETTER,  DIGIT,  =  SYMBOL,  (... 

The  process  described  above  is  carried  out  by  the  sections  o*'  DIAGRAM 
which  set  up  and  test  Sj  and  .  When  it  is  known  that  a  syntactical  element 
will  normally  result  in  the  chain  described  above,  a  slash  is  written  before  that 
element,  and  this  in  turn  causes  T(i)  to  be  set  equal  to  3  for  that  element. 

This  device  usually  results  in  a  15%  to  20%  saving  in  OTP  space.  Combined 
with  the  elimination  device  described  in  part  a)  above,  a  saving  of  40%  to  50%  has 
been  obtained  for  OTP  space.  Since  this  is  by  far  the  largest  table  used,  these 
changes  were  felt  to  be  of  great  significance.  Indeed,  we  have  been  able  to  handle 
input  strings  roughly  twice  as  long  as  those  translated  before  the  new  developments 
were  added. 

The  changes  in  the  syntax  input  which  make  T(i)  =  1  and  3  in  the  appro¬ 
priate  places  are  these: 

LETTER,  /DIGIT,  =  SYMBOL, 

SYMBOL,  -  SUM,  =  STMNT, 

,  -  X,  TERM,  =  TERM, 

,  ,  SYMBOL,  =  TERM, 

,  -  +,  TERM,=  SUM, 

,  ,  SYMBOL,  =  SUM, 

,  =  SUM, 

TERM,  -  X,  TERM,  =  TERM, 

,  ,  SYMBOL,  =  TERM, 

,  =  .SUM, 

SUM,  -  +,TERM,=  SUM, 

,  ,  SYMBOL.  =  SUM, 

{  ,  SUM,  -),=  TERM, 


with  appropriate  changes  in  tlic  definitions. 
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This  revised  syntax  (along  with  A,  =  LETTER,  etc.)  results  in  the  following 
OTP  list: 


1 

28 

20 

23 

2 

23 

21 

3 

3 

11 

22 

74 

4 

110 

23 

-19 

5 

-1 

24 

28 

6 

28 

25 

19 

7 

15 

26 

7 

8 

3 

27 

195 

9 

239 

28 

-22 

10 

-4 

29 

95 

11 

187 

30 

-17 

12 

55 

31 

28 

13 

-9 

32 

19 

14 

28 

33 

3 

15 

15 

34 

39 

16 

7 

35 

-27 

17 

239 

36 

28 

18 

-11 

37 

23 

19 

28 

38 

7 

This  results  in  a  28%  saving  even  in  this  simple  example. 

c)  In  following  the  diagramming  in  the  example,  the  reader  should  have 
noticed  that  twice  DIAGRAM  wasted  a  considerable  amount  of  time  constructing 
something  it  later  could  not  use.  It  tried  to  make  a  term  starting  with  C2  and 
again  with  A2.  These  false  starts  can  be  corrected  by  changing  the  order  in  which 
units  are  constructed  or  by  changing  the  syntax.  In  general  it  can  be  said  that,  in 
order  to  produce  fast  scanning  of  the  input,  a  great  deal  of  care  should  be  exer¬ 
cised  in  putting  together  the  syntax  tables.  This  is  not  generally  critical,  however. 
The  syntax  which  corrects  the  false  starts  for  our  example  is: 

A,  =  LETTER,  (A) 

B,  =  LETTER,  (B) 

C, =  LETTER,  (C) 

0,  =  DIGIT,  (0) 

1,  =  DIGIT,  (1) 

2,  =  DIGIT,  (2) 

LETTER, /DIGIT,  =  SYMBOL,  (/P2/,/Pl/,) 


SYMBOL, 


,  -=,  SUM,  =  STMNT,  (/Pl/,STO 

,  -  X, SYMBOL,  =  TERM,  (LDQ /P2/, /*776FMP  /PI/, /’>‘776) 

,  ,  TERM,  =  TERM,  (/PI/,  XCA/’'‘776FMP  /P2/,/*776) 

,  -+,SYMBOL,  -+,=  YSUM,(FAD  /P2/, /’■•776FAD  /Pl/,/*776) 

,  ,  ,  -x,=  NSUM,  (FMP  /PI/, FAD  /P2/, /*776) 

,  ,  ,=SUM,(CLA  /P2/,/*776FAD  /P1/,/*776) 

,  ,  TERM,=  SUM(/P1/,FAD  /P2/, /*776) 

,=  SUM,(CLA  /Pl/,/*776) 

TERM, -X,  SYMBOL,  =  TERM,  (/P2/,  XCA/*776FMP  /P1/,/*776) 

,  ,TERM,=  TERM,(/P2/,ST0  T/*776/P1,  T/ . /FI/, /, XCA/*776 

FMP  T/*776) 

,=  SUM,  (/PI/,) 

SUM, -+, SYMBOL, -+,=  YSUM,  (FAD  /PI/, /*776STO  T/*776, /P2,T/. /FI/, /, 

FAD  T/*776) 

,  ,  ,-x,=  NSUM,(FMP  /Pl/,STO  T/*776/P2,T/. /Fl/,/,FAD  T/’‘'776) 

,  ,  ,  =  SUM,(/P2/,FAD  /Pl/,/*776) 

,  ,TERM,  =  SUM,(/Pl/,STO  T/='776/P2,T/. /FI/, /,FAD  T/*776) 

(,SUM,-),=  TERM,(/P1/,) 

YSUM,  SYMBOL, -+,  =  YSUM,  (FAD  /PI/, /<‘776/P2/,) 

,  , -X,  =  NSUM,  (FMP  /Pl/,/*776/P2/,) 

,  ,=  SUM,(CLA  /PI/, /*776/P2/,  ) 

,TERM,=  SUM,  (/PI/,  /P2/,) 

NSUM,  SYMBOL,  -x,  =  NSUM,  (FMP  /PI/,  /*776XCA/*776/P2/, ) 

,  ,=  TERM,  (LDQ  /PI /, /*776/P2/, ) 

,TERM,=  TERM,  (/P1/,XCA  /*776/P2/,) 

For  our  sample  problem  this  generates 


LDQ 

A2 

FMP 

B1 

STO 

T 

CLA 

AO 

FAD 

C2 

XCA 

FMP 

BO 

FAD 

A1 

FAD 

T 

STO 

B2 

whteh  is  almost  identical  to  the  previous  result. 
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Appendix  3 

TABLES  USED  BY  DIAGRAM  AND  TRANSYM 

STABS 

This  table  contains  the  syntax  used  by  DIAGRAM  to  diagram  the  input  string. 
The  IBM  7090  word  is  a  36 -bit  word  divided  into  4  groups;  these  are  (starting  at 
the  left)  a  3-bit  prefix,  a  15-bit  decrement,  a  3-bit  tag,  and  a  15-bit  address. 

Using  this  notation  and  defining  the  base  location  of  STABS  to  be  STABS^^  , 

STAB  [  i  ]  =  address  of  (STABS^  +  i),  STC  [  i  ]  =  decrement  of  (STABSq  +  i),  and 
T  [  i  ]  =  tag  of  (STABSq  +  i).  If  STAB  [i]  is  the  subject  of  a  sentence,  then  the 
word  in  STABS^  +  i  +  1  will  have  a  1  in  the  first  bit  of  the  prefix  (this  is  also  the 
sign  bit,  so  we  could  say  that  (STABSq+  i  +  1)  is  negative),  and  the  decrement 
will  contain  the  location  in  the  table  STABD  of  the  definition  corresponding  to  that 
sentence, 

STABD 

This  table  contains  the  definitions  used  by  TRANSYM  to  translate  the  output 
string  OTP,  which  is  put  out  by  DIAGRAM.  This  table  has  been  squeezed  down  so 
that  four  9-bit  characters  go  in  each  word.  However,  each  definition  starts  in  a 
new  word,  the  last  word  of  the  previous  definition  being  filled  in  with  O's  if 
necessary. 

Thus  the  sentences 

LETTER,  LETTER,  =  LTRGRP,  (/P2/,/Pl/,) 

,=  LTRGRP,  (/PI/.) 

with  LETTER  represented  internally  as  105,  LTRGRP  as  653,  P  (i.e.  "/P") 
as  750.  Ki.e.  ''/,")as  753,  {  (i.  e.  "(")  as  754,  and  }  (i.e.")")  as  755, 

would  appear  as  STABS^ 

STABS  +  a  0  0+  3  0  00105 

o 

+  a  +  1  0  00000  0  00653 

+  a  +  2  4  0  00000 

+0+3  0  0000  0  00653 

+  0+  44  /3  +  2  0  00000 


STABD 

o 

STABD  +  /I  7  50  002  753  750 
o 

+  /3  +  1  001  753  755  000 
+  /3  +  2  750  001  753  755 
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TRANT 

This  table  (400g  =  256  words)  can  be  thought  of  as  indexing  the  STABS  table. 
In  the  current  usage  both  the  decrement  and  address  of  the  words  in  TRANT  are 
used. 

The  decrement  of  (TRANT^  +  7  )  contains  TRAN  [27],  where 
STAB  [  TRAN  [27]]  is  the  second  element  of  the  sentence  which  started  with  the 
syntax  element  2  7  ,  and  the  address  of  (TRANT^+  7  )  contains  TRAN  [27+1]. 

Using  the  previous  example  again,  since  105g  =  =  (2)  (34)  +  1,  the 

address  portion  of  (  TRANT^+  34  )  would  contain  a  . 

INDXS  and  SUCRT 

The  function  SUCCR  [  Al;  A2  ]  is  used  to  see  if  it  may  be  possible  to  form 
the  Syntactical  element  A2  from  a  section  of  the  input  string,  having  already  as¬ 
certained  that  part  (or  all)  of  this  section  is  syntactical  element  Al. 

The  n—  word  of  INDXS  (IOOOq  =  512  words)  is  of  the  form  a  ,  a  ,  b  c  , 

0  nl  ni  n  n 

where  a  ,  and  a  ,  each  take  9  bits,  b  takes  13  bits,  and  c  takes  5  bits, 
nl  n2  n  '  n 

a^^  is  the  lowest  syntactical  element  for  which  SUCCR  [n;  m]  is  true,  and  a^^ 

is  the  highest,  b^  is  the  location  in  SUCRT  of  the  word  defining  SUCCR  [  n;  m  ] 

for  a  ,  <  m  <  a  T  ,  and  c  the  bit  in  that  word  for  a  ,  +  1 .  Thus  if  we  want 
nl  n2  ’  n  nl 

to  see  what  syntactical  elements  could  be  built  from  element  10 3g  ,  we  look  in 
INDXS  could  contain,  say 

777  000  0  00000  meaning  103  can  lead  to  nothing  (since  it  is  our  policy 
never  to  use  777  for  a  syntactical  element),  or 

104  104  0  00000  meaning  103  can  lead  only  to  element  104,  or 

063  105  0  00672  meaning  103  can  lead  to  063  and  105,  and  that 

b  =  000  000  000  1101,  or  15„  =  13,.,  and  c  =  11010,  =  32.  =  26,.  ,  so 
n  2  8  10  n  2  8  10 

that  in  words  SUCRT^  +  13^^  and  SUCRT^  +  14^^  we  might  have  (starting  at  the 
26—  bit  of  SUCRT^  +  13  with  the  sign  bit  considered  as  bit  0  ) 

SUCRT  Q+  13 -  0  0  1  1  00  1  1  1  0 

SUCRTq+  14  1101111 - 

meaning  that  103  can  lead  also  to  066,  067,  072,  073,  074,  076,  077,  101, 
102,  103,  and  104,  but  not  to  064,  065,  070,  071,  075,  or  100. 

INPUT  STRING  and  the  table  INSYM 

Each  input  character  is  allowed  9  bits,  and  four  characters  are  put  in  each 
word  of  the  input  space.  Each  character  on  a  card  is  read  in  its  BCD  represen¬ 
tation  (  A  is  21g,  $  is  53g,  *  is  54g,  /  is  6lg,  +  is  20g  ,  and  a  blank  is  60g), 
and  a  0  is  added  to  the  left  (A  is  021,  $  is  05  3,  etc. )  for  all  characters  except 
a  '.  Following  a  '  the  characters  up  to  and  including  the  next  '  are  read  in. 
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saving  the  first  six.  Then  these  six  characters  (or  less  if  there  were  less  than 
six  between  the  '  ,  s)  are  compared  with  the  symbols  in  the  table  INSYM.  This 
table  consists  of  pairs  of  words,  the  first  containing  a  symbol,  the  second  a  9-bjt 
representation  for  this  symbol.  If  the  input  symbol  matches  a  symbol  in  INSYM, 
then  the  corresponding  9  bits  are  put  into  the  input  string,  otherwise  an  error 
stop  occurs. 

Thus,  if  the  symbols  IF  and  GO  TO  are  assigned  to  304  and  457  respec¬ 
tively,  the  card  'IF'  (A  =  B1*C)  'GO  TO'  357  is  put  into  the  input  string  (as¬ 
suming  this  is  the  first  card)  as 

000  304  074  021  |  013  022  001  054  |  457  003  005  007  |  060  060  ... 

Note  that  000  is  put  in  for  INPUT  (0),  since  INPUT  (1)  is  the  first  input  looked 
at  by  DIAGRAM. 

OTP 

This  table  contains  the  output  from  DIAGRAM,  which  is  also  the  input  to 
TRANSYM.  Output  (i)  occupies  the  left  half  of  word  OTP^  +  ^/2  if  i  is  even, 
and  the  right  half  of  word  OTPq  +  if  i  is  odd.  The  table  is  4096  words 
long,  which  allows  for  8191  outputs  from  DIAGRAM. 

SUBL  and  ISUBIX 

ISUBIX  is  the  substitution  table  used  by  TRANSYM,  and  SUBL  is  the  indexing 
table  to  ISUBIX.  When  TRANSYM  is  to  output  a  simple  character,  the  SUBL  table 
is  scanned  (from  latest  entry  back  to  first  entry)  to  see  if  it  contains  that  character. 
If  not,  that  character  is  put  out  as  is;  if  it  is,  then  a  link  is  obtained  which  gives 
the  location  in  ISUBIX  of  the  string  to  be  substituted  for  that  character.  The  phys¬ 
ical  structure  of  SUBL  is  as  follows:  the  character  is  in  the  address  portion  of  a 
word,  with  the  link  in  the  decrement.  The  substitution  strings  are  four  9-bit 
characters  per  word  with  each  new  string  starting  in  a  new  word.  For  example, 
suppose  that  TRANSYM  is  looking  at  the  character  172,  and  in  SUBL  we  have  the 
word 

000  107  000  172 

And  in  ISUBIX  +  107g  we  had  (say) 

C  L  A  A 

ISUBIX  +  107.  023  043  021  060 

o 

S 

+  llOg  062  777  000  000 

Then  the  output  would  be  CLA  S  . 

The  character  777  signals  TRANSYM  that  that  is  the  end  of  the  substitution 
string. 
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OTPX  String 

This  area,  which  is  constructed  and  addressed  exactly  the  same  as  the  INPUT 
string,  serves  as  a  temporary  storage  space  used  when  TRANSYM  is  constructing 
a  substitution  string,  which  will  later  be  stored  in  ISUBIX,  or  a  parameter  string 
to  be  used  by  one  of  the  functions  ^  . 
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Appendix  4 


AVAILABLE  FUNCTIONS  OF  TRANSYM 

The  functions  have  been  specified  only  for  n  =  1,  2,  and  5,  which  have 
been  patterned  after  similar  functions  used  by  Irons.  All  other  values  of  n  are 
at  the  user's  disposal. 

This  is  a  function  of  either  one  or  no  variables.  It  generates  a  new  symbol 
everytime  it  is  called.  The  symbols  are  of  the  form  zlOOO,  zlOOOlj . . . , 
z9999.  The  first  time  it  is  used  in  a  definition  it  must  have  a  decimal  num¬ 
ber  as  an  argument,  this  being  the  total  number  of  times  occurs  in  the 
definition.  The  subroutine  takes  the  current  value  of  a,  a  constant  at  its 
disposal  which  is  initially  set  at  1000,  and  stores  this  in  ,  a  space  in  the 
TRANSYM  push  down  space  reserved  for  this.  Then  a  is  increased  by  the 
value  of  the  argument  of  Then  the  symbol  za^  is  output  and  is 

increased  by  one.  Thereafter  calls  of  (which  now  must  have  no  arga- 

ment)  only  perform  the  last  part  of  the  above  operation,  that  is  outputting 
za  and  increasing  a  ^  by  one . 

!5’2  This  is  a  function  of  one  argument,  a  decimal  number.  Call  this  number  /3  , 
and  set  Y  =  a  -  ^  .  Then  y,  outputs  zy  and  exits.  Thus  we  see 
that  '  I  D  outputs  the  same  symbol  as  output  during  its  last 

call  on  the  same  level  of  TRANSYM;  J  2  J  outputs  the  same  symbol 

as  did  the  time  before  last,  etc. 

5^  has  two  arguments,  the  first  being  a  string  of  characters.  This  string 

may  contain  any  number  of  characters.  J’g  simply  counts  how  many  times 
the  character  I  (031)  occurs,  adds  this  to  the  second  argument,  and  outputs 
this  as.  a  decimal  digit  string. 

&  I  A  I /  732  Oil]  91)  thus  has  12  as  its  output. 

In  all  the  above  cases  the  output  is  put- out  under  control  of  the  call  of 
TRANSYM  which  called  it.  Thus  if  PUTIKER  is  T,  the  output  converter;  if  F, 
it  goes  to  OTPX. 
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